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Abstract 

A maximum weighted matching for bipartite graphs G = (A U B, E) can be 
found by using the algorithm of Edmonds and Karp with a Fibonacci Heap and 
a modified Dijkstra in 0(nm + n 2 logn) time where n is the number of nodes and 
m the number of edges. For the case that \A\ — \B\ the number of edges is n 2 and 
therefore the complexity is C(n 3 ). 

In this paper we want to present a simple heuristic method to reduce the number 
of edges of complete bipartite graphs G = (A U B, E) with \A\ = \B\ such that m = 
n log n and therefore the complexity of the maximum weighted matching algorithm 
will be 0(n 2 logn). The weights of all edges in G must be uniformly distributed in 
[0,1]. 
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1 Introduction 

Computing the maximum or minimum weighted matching of bipartite graphs plays an important 
role in practice. Two applications are the image feature matching problem [2] or the assignment 
problem. 

A well known algorithm is due to Edmonds and Karp and dates back to the year 1972. The two 
key ideas behind their network flow algorithm which converts the problem into a network flow 
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problem is a) use the BFS to find augmenting paths and b) find a path that increases the flow 
at most. 

All logarithms in this paper are to the base of 2 and all graphs G = {A U B, E) have the 
following properties: each partition A and B of G consists of n nodes and the weights of the 
edges in E are uniformly distributed over the interval [0,1]. Therefore, the number of edges is 
m = n 2 . To reduce the complexity of the maximum weighted matching algorithm by Edmonds 
and Karp to 0(n 2 logn) we have to reduce the number of edges in E to nlogn. Therefore we 
have to remove n 2 — n log n edges from E. The heuristic presented here will do this in 0(n 2 ) time. 

The paper is organized as follows: in section 2 we describe a simple approach to reduce m to 
nlogn, in section 3 we try to improve the heuristic a bit and in section 4 we present empirical 
results for some graphs. 



2 Heuristic 

The heuristic is based on a randomized algorithm. This algorithm selects edges at random de- 
pending on their weight and builds a new graph G' = (A U B,E') from G where we expect 
\E'\ pa nlogn. Let E = {e\, . . . , e„2}. Then, we can compute the expected number of edges 
which will be selected by the algorithm to be in E' as follows: 

Let Xi be a random variable for 1 < i < n 2 indicating whether an edge e G E also is member of 
E'~ 

1, e; e E' 
0, e t i E' 



Furthermore, define X := X\ + X 2 + . . . + X n i. As we can see this is the number of edges in E' . 
The expected value of X is computed as follows: 

2 2 2 

n n n 

ELY] = ^ELY;] = (! ' Pr I e e E '\ + ° • Pr I e i E '\) = J2 Pr ^ e e E '\ 

i—l i—1 i—1 

where Pr[e e E'] is the probability that e e E' . 

A simple approach to compute the probability is to set Pr[e e E 1 ] = cw(e) where w(e) is the 
weight of the edge e and c a constant. Then, we have to choose c such that 

2 2 

n n 

^Pr[e G E 1 ] = ^cw(ei) = nlogn 
»=i i=l 

holds. This is done as follows: 

^ nlogn logn 
> cw(ei) = nlogn =>■ c = 5 ps 2 

The constant c can be approximated by 2 log n/n because the edges are uniformly distributed 
over the interval [0, 1] and therefore, the value of the denominator is pa n 2 /2. Now, we can set 
the probability that an edge e £ E is selected by the algorithm to 

Pr[e G E'} = 2w(e)^. 

n 

Then, the expected number of edges in E' is nlogn. 
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edges sorted by their weights 




edges 

Figure 1: Graphs when sorting all edges of E by their weights. For uniformly distributed 
weights we expect the upper graph. For weights modified by the heuristic described in 
section 2 we expect the lower graphs. 



3 Modifying weights to improve results 

In this section we want to modify the computation of the probability a bit to try to improve the 
heuristic of the previous section. The idea is not to use a constant factor to modify the weight 
of an edge but a function which computes a new weight w'(e) for all e £ E in the interval [0, 1] 
depending on the weight of e. After applying this function to the weights of all edges of E the 
new weights are not uniformly distributed anymore. Weights of edges near to one will be more 
probable than edges with small weights because we think that these edges are more important 
for a maximum weighted matching. The function is parameterized by a variable k. 

We compute the new weight w'(e) and the probability that the edge e is in E 1 as follows: 

w'(e) — cw k (e) => Pr[e 6 E 1 ] = cw'(e) = cw k (e) 

As we can see for k = 1 this function is equal to the function presented in section 1 and the new 
weights are uniformly distributed. 

The value for the constant c can be computed analog to the method described in section 1. 

n 2 

i=i 

The value of c can be approximated with the following idea. First, we sort all edges of E by their 
new weight and put them into a diagram as shown in Figure ^ With the help of this diagram it 
is easy to see that the denominator of c can be approximated by the area under the appropriate 
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Figure 2: For six graphs the weights of optimal matchings and weights using the heuris- 
tic with different values for the parameter k were computed. For weights which were 
computed by using the heuristic the tolerance to the optimal weight is given. 



graph with the following integral: 



k , i r i. , i j_ 

k+i ' ~tf k r 



With that it follows 



and 



_i L . n 2(fc+l) = 1 n 2 

fc + 1 n 2fe fc + 1 



nlogn nlogrt logn 

— n = \k + 1) 

_ — ti^ n 



Pr[ee£'] « (fc + l)^V(e) 
n 



As we can see the complexity of this heuristic is 0{kn 2 ) because to each edge of E we have to 
compute the new weight which is done by fc — 1 multiplications. Since we use fixed precision for 
all numbers we assume that each multiplication can be performed in 0(1). 

4 Empirical results 

In this section we want to present some empirical results of our heuristic method for random 
graphs with 1000 up to 6000 nodes. For each graph we have computed the exact weight of a 
maximum weighted matching and the weight of a matching computed by our heuristic for several 
values for the parameter fc. For the heuristic the weights are given relative to the exact weight. 
The results are shown in Figure El 

The time required to find a maximum weighted matching is not shown in Figure but for the 
graph with 6000 nodes and fc = 50 the network flow algorithm by Edmonds and Karp runs with 
the heuristic more than 22 times faster than without using the heuristic. Also, we recognized 
that smaller values for k didn't boost the algorithm. 



4 



References 



[1] J. Edmonds and R. M. Karp. Theoretical improvements in algorithmic efficiency for network 
flow problems. J. of the Assoc. for Corn-put. Mach., 19(2):248 - 264, Apr 1972 

[2] Y.Cheng, V. Wu, R. Collins, A. Hanson and E. Riseman. Maximum-weight bipartite match- 
ing technique and its application in image feature matching. In Proc. SPIE Visual Comm. 
and Image Processing, Orlando, FL., 1996 



5 



